H gen#me/^ 

Joumals.ASM.org 

Complete Genome Sequence of Sfll, a Serotype-Converting 
Bacteriophage of the Highly Prevalent Shigella flexneri Serotype 2a 

Divya T. George,^ David P. Stephenson,^ Elizabeth Iran,'' Renato Morona,'' Naresh K. Verma^ 

Division of Biomedical Science and Biochemistry, Research School of Biology, the Australian National University, Canberra, Australian Capital Territory, Australia^; School of 
Molecular and Biomedical Science, the University of Adelaide, Adelaide, South Australia, Australia^ 

Sfll is a serotype-converting temperate bacteriophage of the highly prevalent Shigella flexneri serotype 2a. We isolated the Sfll 
phage from a wild-type strain of S. flexneri serotype 2a. Here, we present the complete genome sequence of this phage. 
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Shigella flexneri strains are most frequently linked with out- 
breaks of shigellosis, and these strains are susceptible to infec- 
tion by temperate lambdoid bacteriophages. Here, we isolated and 
completely sequenced bacteriophage Sfll, induced from a virulent 
S. flexneri 2a serotype, strain NCTC 4. 5. flexneri 2a is the most 
widespread serotype associated with shigellosis cases (1). 

Electron microscopy data (2) show that Sfll morphologically re- 
sembles bacteriophages belonging to group Al, from the family Myo- 
viridae and the order Caudovirales (3). Sfll confers its host with the 
serotype-converting O-antigen-modifying glucosyltransferase {gtr) 
genes. Serotype conversion is a key defense mechanism used by 
S. flexneri to evade host defense responses (4). Not much is known 
about temperate bacteriophages of S. flexneri outside their role in 
serotype conversion. 

To further our understanding of 5. flexneri phages, the com- 
plete sequence of Sfll was determined. Eight PstI fragments gen- 
erated from the Sfll genome were cloned individually into a 
pBlueScript vector, and both strands of each fragment were then 
sequenced using Sanger sequencing. Primer walking was used to 
fill in gaps and determine the order of the fragments. The DNA 
sequences were assembled into contigs using BioEdit (5). Putative 
open reading frames (ORFs) were identified using the NCBI ORE 
Finder and CLC Main Workbench version 6.5. 

The genome of Sfll is linear, double-stranded, and is 41,475 bp 
with an average G+C content of 49.17%, corresponding to 58 
coding sequences (CDSs). Most of the genome (71.57%) is pre- 
dicted to be transcribed from the sense strand, while 19.14% of the 
genome, including the gtr cluster, is predicted to be transcribed by 
the antisense strand. In database searches of the 58 predicted OREs 
using NCBI BLASTx, 41 were assigned functions and 17 were 
identified as hypothetical proteins. 

Nucleotide and protein homology searches with other pub- 
lished S. flexneri phage genomes indicate that Sfll, like SfV (6), Sfl 
(7), and Sf6 (8), is a member of the temperate lambdoid group of 
bacteriophages with their conserved arrangements of early and 
late genes. A comparison of phage terminases found that the pack- 
aging mechanism used by Sfll is similar to those of SfV and Sfl, 
with cohesive end {cos) sites spanning bp 59 to 125, adjacent to the 


terminases. Although Sfll contains features of SfV^ and Sfl, it has a 
unique host range. While SfV was capable of infecting 7 of the 12 
S. flexneri serotypes tested (serotypes la, lb, 2a, 2b, 3b, 4b, and Y), 
Sfll and Sfl only infected 3 (serotypes 3b, 5a, and Y) and 2 (sero- 
types X and Y) of the 12 serotypes tested, respectively. 

The main differences between S. flexneri Sfll, SfV, and Sfl lie in 
their immunity, replication, and Nin regions. Our analysis found 
the following genes to be unique to Sfll: Sfll_25 (acyltransferase), 
Sfll_27 {gtrll), Sfll_26, Sfll_55, and Sfll_56 (transposases), 
Sfll_35, Sfll_36, Sfll_40, Sfll_41, Sfll_49, Sfll_ 50, and Sfll_57 
(hypothetical proteins), and Sfll_48 (antiterminator Q). It will be 
interesting to study the hypothetical proteins that are unique to 
Sfll in the context of Shigella pathogenesis. 

Nucleotide sequence accession number. The complete ge- 
nome sequence of bacteriophage Sfll has been deposited in Gen- 
Bank under the accession no. KC736978. 
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